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Overview 


Teacher evaluation has undergone rapid reform over the past decade.? School systems set out to help strengthen and 
improve their existing evaluation systems, which were just not functioning effectively. Under past systems, almost all 
teachers were being rated as satisfactory and therefore largely viewed as interchangable.* 


Evaluation systems must be able to differentiate among different teachers’ effectiveness and apply that information in 
key personnel decisions in order to build the strongest possible teacher workforce. An examination of many states’ and 
school districts’ current approaches to evaluation shows that they have made some real progress in developing improved 
systems over the past decade. For example, many teacher evaluation systems historically included only two rating levels 
— satisfactory and unsatisfactory — with almost all teachers earning the former.* Today, most states’ and large districts’ 
evaluation systems consist of three or more rating levels. While previous ratings were based primarily on observations 
of the teacher by a principal, more evaluation systems now factor in objective measures of student growth, as well as 
other measures such as student input in the form of surveys. Whereas many school systems once routinely evaluated 
tenured teachers on a two- to fiveyear cycle, approximately half of the 100 largest districts and half of al states now require 
annual, summative evaluations for all teachers and have policies to use the results of evaluation systems to inform teacher 
compensation.3® 


Despite this clear progress, strengthening evaluation systems so that they reflect the genuine distribution of teacher talent 
in a district or state has proven challenging. One of the setbacks was the implementation of new teacher evaluation measures 
in tandem with new college- and career-ready standards and assessments for students. These simultaneous transitions 
meant that teachers were making significant changes in what they taught at the same time they were being more rigorously 
evaluated, causing some teachers to bristle at these policy changes. 


Fortunately, we can identify six pioneers — four districts and two states — which are reporting genuine successes. They 
are implementing many of the same components commonly found in many state and district systems; however, their 
results are setting them apart. 


These six systems are distinguishing among teachers of varying quality and are delivering the changes initially sought 
by districts and states across the nation. Some of these systems are already able to boast measurable evidence that 
the teacher workforce is improving, with higher-performing teachers staying longer and weaker teachers, who previously 
might never have even known they were weak, choosing to leave. 


This analysis focuses on these six systems: Dallas Independent School District, District of Columbia Public Schools, Denver 
Public Schools, Newark Public Schools, New Mexico, and Tennessee. It depicts how evaluation systems can benefit teachers 
and, most importantly, students. 


The districts and states discussed can each report publicly available evidence of the positive impacts of their evaluation 
systems, Although the specific successes observable in the districts and states profiled here differ, each offers examples 
of progress that is possible under welldesigned and wellimplemented teacher evaluation systems which achieve a more 
honest distribution of teacher talent and which emphasize continuous improvement. The case studies are drawn from 
official policies, reviews of the evaluation system that these states and districts have conducted, independent studies, and 
interviews with district and state staff. 
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Goals of teacher evaluation 


Strong teacher evaluation systems, when paired with supports and incentives, are designed to do the following: 
1) Provide a more valid measure of teacher quality by distinguishing between teachers at different performance levels; 
2) Recognize strong teachers and keep them in the classroom; 
3) Encourage consistently less effective teachers to leave the classroom; 
4) Help all teachers improve; 
5) Recruit more effective new teachers; and 
6) Achieve gains in student learning and other positive student outcomes. 


What these six systems have in common 


The knowledge base for building a strong evaluation system is stil young and is continuously being refined, but some clear 
principles of strong practice have emerged from a decade of innovation and implementation (see the appendix for more 
details on these practices and supporting research). As is immediately evident, the districts and states highlighted here 
have each implemented many of these strong practices. 


Their success is a result of adherence to core principles’ 
Dallas 


District of 
Independent Denver Public Columbia Public Newark 
Strong Practice School District Schools Schools Public Schools Tennessee New Mexico 


Student surveys 
At least three rating categories v v v ve v v 


Professional development 
tied to evaluation cA v a cs v v 


Although 89 of the One notable feature common among the six locales highlighted here is the use 
100 | distri cof muitiple measures to comprise the overall evaluation rating. The systems 

00 largest districts vary in the precise measures used, but each uses at least three measures 
use multiple measures, | and includes some measures of student learning, as well as observations 


only 21 use or allow and, in many cases, student surveys. 
student surveys.!° 
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Their success is built on a thoughtful approach to weighting individual evaluation components 


Weight of individual Dallas District of 
components Independent Denver Public Columbia Public Newark 
(vary by grade and subject) School District Schools Schools Public Schools"? Tennessee 
tag 
Observations! 50.80% 30.35% 30-75% S5eoK? aaa 
‘student surveys) 
Student achievernent or growth 20-35% 50% 15-50% 15-45% 50% 
Student surveys 015% 010% 010% 0% See “Observations” 
Professionalism 0% 10-15% 0% 0% 0% 
Commitment to the school 
anny 0% 0% 10% 0% 0% 
Teacher attendance 0% 0% 0% 0% 0% 


Evaluation systems can be an essential part of a district's or state's talent management strategy, not only to give teachers 
feedback and support, but also to inform myriad decisions such as eligibility for leadership roles, raises, or retention in 


the classroom. 


Their success is made possible by linking evaluation to key personnel decisions 


Dallas District of 
Independent Denver Public Columbia Public 
Key decisions School District Schools Schools 
Ties compensation to 
evaluations we SA % 
Selects cooperating 
teachers to host student 
teachers based on 4 a, 
evaluations 
Selects teachers for 
leadership opportunities Vv v v 
based on evaluations 
Tailors professional 
development based on v vf v 
evaluations 
Makes teacher dismissal 
decisions based on v v v 
evaluations 
Incentivizes effective 
teachers to work in v v v 
highneed schools 
Other Identifies Earning or Adapts 
effective losing non- evaluation 
teachers to probationary rubric to 
teach summer (or tenure) _—_assess teacher 
school status! applicants during 
hing process 


Newark 
Public Schools 


v 


Tennessee 


v 


v 


Informs teacher 


preparation program 
‘accountability; influences 
licensure advancement 
and renewal decisions; 
determines Ifa new 
teacher qualifies for 
tenure; informs layoff 


decisions 


New Mexico 


40% 


35% 
5% 
15% 


0% 
5% 


New Mexico 


v 


Informs teacher 
preparation 
program 
accountability; 
influences 
licensure 
advancement 
and renewal 
decisions; 
informs selection 
of the state 
Teacher of 

the Year 
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Conversations with district and state staff in these six sites emphasized that attaching meaningful consequences to evaluation 
systems encourages teachers and principals to take them seriously. Teachers were more inclined to seek support, and 
principals were more motivated to have difficult conversations with their staff. While teachers continue to earn bonuses or 
raises for many reasons, compensation tied to positive ratings on evaluations is increasingly common. 


Their success is fueled by making strategic compensation decisions 


Dallas, District of 

Independent Denver Public Columbia Public Newark 

School District Schools Schools Public Schools. Tennessee New Mexico 
Teaching ahardtostaff subject Bonus Bonus'* Bonus Raise Award 
Costotiving adjustment Raise Raise 
Earning an advanced degree Raise Raise Raise 


With the exception of Denver Public Schools, whose superintendent has remained in place since its evaluation system was 
first piloted, every system featured here has survived changes in leadership. While these systems continue to evolve and 
improve, they have all maintained their core principles. 


The evaluation systems featured here include many of the characteristics that research and common sense suggest will 
yield the greatest benefits for teachers, school systems, and students. However, the staff implementing these systems 
‘emphasized that they have only reached this point because of a consistent commitment to assessing and improving their 
systems. None of these evaluation systems got everything right in the first year of implementation. System leaders gathered 
feedback from teachers, principals, and other stakeholders; analyzed the data they had gathered; and used this information 
to identify weak points and refine their systems. This systemic commitment to continuous improvement persevered despite 
leadership transitions experienced by each of the districts and states profiled here. 


Evaluation is not a silver bullet that will automatically improve teacher effectiveness and student outcomes. But as the 
following six examples illustrate, with careful implementation and a commitment to build upon what works and remedy 
what does not, an evaluation system can be an essential tool in state and district efforts to ensure that every student has 
access to effective teachers. 
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Dallas Independent School District (DISD) 


Evaluation System: Teacher Excellence Initiative (TEI) 


History: Implemented in 2014 under Superintendent Mike Miles and continues under Superintendent 
Michael Hinojosa. 

How it works: Teachers earn one of seven evaluation ratings that are then combined with other criteria (e.¢., 
previous years ratings, additional evaluation processes) which determines teachers’ pay levels. 

Impact on salary: Maximum: $100,000 ($90,000 in salary, plus up to $10,000 in bonuses}, achievable in as few 
as six years. 


Maximum prior to evaluation system implementation: $87,338, achievable in 37 years with a Ph.D. 


REFLECTING ON SYSTEM 
Notable features 
1 The district sets a target distribution for the percentage of teachers Dallas is “able to identify our 


earning each rating. This policy forestalls the upward creep in ratings fs 

that may not be covtelated with achievement an alow the dsinct_-'™OSt effective teachers... 

to budget responsibly. [and we see that] retention of 

While all teachers are evaluated on core aspects of their job (classroom  Qur most effective teachers is 
formance, students’ achievement, students’ classroom experiences), 

teachers mst go through an additonal review to enhanc a strength of the system.” 

rating levels. Only teachers who are in the top 30 percent of their — Suzy Smith, Monager of 

peer group”, are in at least their third year of service, and meet Performance Management?” 

other performance criteria can apply to be in these elite categories.22 


All teachers are formally observed several times a year, but the district also requires short “spot observations,” 
which can save principals time and give them more realtime opportunities to engage with teachers and learn about 
their strengths and areas for growth.”® 


To provide more salary stability and protect teachers from an anomalous bad year, Dallas averages a teacher's current 
and previous year’s ratings to determine salary. Also, teachers’ salaries only drop if they have earned a lower effectiveness 
level for three consecutive years. 


Teachers who earn one of the lowest two rating categories do not receive a raise. 


The Dallas evaluation system has a targeted distribution for all of its evaluation ratings, meaning that the district 
limits the number of teachers who fall into one of the ratings categories it has created. No more than 3 percent of 
teachers can receive the lowest rating, and no more than 2 percent can receive the highest rating. 


Opportunities for improvement 


= The system involves two different rating systems: one to measure performance in the most recent school year and a 
more cumulative system that incorporates annual ratings and other data to determine pay. These two systems have 
similar names, which has caused confusion. 

= In a 2017 survey, roughly a quarter of teachers felt the evaluation system was unfair to teachers facing greater 
challenges in loweperforming schools.** 


Notable changes to system since inception 


= When this system first started, salaries were based entirely on performance. Now, Dallas offers retention bonuses (which 
grow as teachers earn higher ratings) so that teachers earning all but the lowest ratings can see their compensation 
increase each year. 


= The district plans to reduce the number of possible compensation levels, as well as increase the current cap on the 
percentage of teachers who can qualify for the highest rating. 
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BUILDING BUY-IN 


Steps to build buy-in 
The district gathers feedback on the evaluation rubric from teachers, principals, and other stakeholders and calibrates, 
it through campus walks and data analysis.* Dallas also recruits teachers on each campus to attend regular evaluation 
system trainings and to share this information with their colleagues. 


Surveys of teachers and administrators 
Ina 2017 survey, only 41 percent of teachers were satisfied with the system, but this is an improvement from 2014- 
2015 when only 31 percent were satisfied.” Despite this level of dissatisfaction, many teachers like the individual 
pieces of the system. About three quarters of teachers found each aspect of the evaluation (spot observations, 
extended observations, and summative evaluation feedback) helpful.2® 


A 2017 survey found that administrators generally iked the system (72 percent were satisfied).22 


EVIDENCE OF IMPACT 


Recognizes strong teachers and keeps them in the classroom? Because of ongoing calibration 
Yes, 98 percent of the highest rated teachers stayed in the district training for evaluators, “We 


after the 2016-2017 school year.*° Poids GB GISTISIEAL IS DERG 
Encourages consistently less effective teachers to now that proncient 's proncient, 
leave the classroom? regardless of campus, content 
Yes, 50 percent of consistently unsatisfactory teachers left the district grea, or grade level.” 
after the 2016-2017 school year. rel 
Helps all teachers improve? 
Yes, teachers’ average spot observation and overall evaluation 
scores are rising, suggesting that teachers may be performing 
better over time (although score inflation could also cause these 
increases) 2 


Correlates with increased student achievement? 
Yes, student proficiency across all subjects and grades (already on a modest upswing) increased more steeply by 
7 percentage points from 2015 to 2017, closing the proficiency gap between Dallas and the state of Texas by 3 
points.2? Note that these gains cannot be tied directly to the evaluation system without a controlled study. 


Other evidence of impact? 
Dallas focused resources on a set of schools identified as "Improvement Required” by the state education agency. These 
resources focus heavily on strategic staffing, including using incentives to attract effective teachers to work in these 
schools. Since the program's inception, these schools have improved in both student performance and attendance.* 
Dallas credits the drop in the number of designated low-performing schools (from 43 in the 2013-2014 school year to 
only four for the upcoming 2018-2019 school year) largely to its evaluation system.2® 


— Suzy Smith, Manager of 
Performance Management 
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Denver Public Schools (DPS) 
Evaluation System: Leading Effective Academic Practice (LEAP) 


History: Piloted in 2012-2013" and full implemented in 2013-2014 under Superintendent Tom Boasberg, 
who continues to lead the district in 2018, although he recently announced that he will resign 
later this year. 


How it works: Teachers earn one of four ratings. 


Impact on salary: Maximum: Undefined. The salary is comprised of base pay and incentive pay for a number 
of factors (e.g., working in a highneed school or teaching a hard-to-staff subject, receiving a 
satisfactory evaluation rating). Teachers’ future base earnings potential is currently not limited 
as there is no salary cap on the current step and grade schedule. 


Maximum prior to evaluation system implementation: $74,218 with a Ph.D. and 12 years of 
experience. 


REFLECTING ON SYSTEM 
Notable features 


1 Teachers who earn effective ratings are eligible to apply for arange The system is “now integrated 
of teacher leadership positions, which allows them to take on a 


leadership role in their school (and earn up to $5,000 on top of into how we think about 
their salaries) without leaving the classroom. These teacher leaders growth and development, 


apply the district's Framework for Effective Teaching when coaching 
and evaluating their peers. To read more about how Denver Publ OW We look at and leverage 
Schools, a Great District for Great Teachers, approaches teacher + renewal decisions, and how 


leadership, visit here. we provide support for new 
The district's research finds that early career teachers tend to improve 4 
faster when they are coached on a narrow set of skills and receive teachers and make promotion 
direct, bitesized feedback on those specific skils, rather than trying Gecisions for leadership 

to develop skils in many areas at once. Using ths information, Denver positions.” 

has developed a New Teacher Playbook for early career educators. The 

playbook prioritizes specific skils and provides guidance intended to 
help teachers make quick, measurable improvement"? 


Principals and their instructional leadership teams can take advantage 
of optional peer observations. They are paired with an evaluation system 
specialist who helps school leaders become more consistent in 
observing teachers and providing highteverage feedback. in addition, 
central partners help instructional leadership teams analyze student 
data to ensure that teachers’ growth translates into accelerated student 
growth. 


Denver requires all observers to complete and pass an initial observation certification process. All observers also 
engage in an annual schooLbased calibration session with an evaluation system specialist, which allows the district to 
identify and support observers whose feedback and ratings may be misaligned. 


In 2015, the district launched incentives for teachers in 30 of its highestpriority schools based on recommendations 
from teachers. Teachers receive a monthly incentive for working in a highestpriority school and a yearly retention incentive 
for returning to these schools, with higherated teachers earning more. Teachers in the highestrating category can 
earn up to $4,000, in addition to other incentives and base pay, for working and staying in a highest priority school. 


— Nicole Wolden, Director of 
Growth and Performance 


= The district views its use of the framework as being pervasive across the district, building the foundation for how it 
defines excellence in teaching.” 
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Opportunities for improvement 


= The district has several mechanisms to identify areas for improvement of the evaluation system and to strengthen 
those areas. The district regularly engages teachers in these efforts and has created a collaborative committee in 
partnership with the teachers union.2 Denver also seeks to build teachers’ understanding of and confidence in how the 
system can be used to help them improve. 


Notable changes to system since inception 


= Based on feedback from teachers, in some circumstances", Denver now allows instructional leaders to incorporate their 
‘own judgment (based on a holistic analysis of a teacher's performance) into a teacher's final evaluation rating. 


1 Informed by feedback from teachers, the district changed the scoring model for how student perception data factor into 
teacher evaluations. Denver now compares survey data across subsets of teachers, rather than across all teachers in 
elementary schools or all teachers in secondary schools. For example, student perceptions of elementary homeroom 
teachers are compared to other elementary homeroom teachers, but not to elementary specials teachers (e.g., at teachers). 


= The district created content-specific framework appendices to ensure that the system is adaptable to teachers working 
with students across various grades, contents, and contexts. 


BUILDING BUY-IN 


Steps to build buy-in 
The district provides training on the evaluation system for all new teachers, focusing on the evaluation system's role 
in helping teachers grow in their practice. 


Surveys of teachers and administrators 
On the most recent district survey, more than 80 percent of teachers reported that they felt comfortable having 
honest conversations about their performance with their coaches and believed that their coaches had an accurate 
understanding of their effectiveness. Two-thirds of teachers also reported that their experience with the evaluation 
system helped improve their teaching practice.* 


Aimost 100 percent of school leaders and Senior/Team Leaders responding to the survey reported that they knew 


what was expected of them to implement LEAP as a growth tool. Nearly 90 percent also reported that the culture in 
their buildings supports honest assessments of teaching performance. "= 


EVIDENCE OF IMPACT 
Provides a more valid measure of teacher quality? 
Yes, in 2017-2018, 24 percent of teachers were rated Distinguished, When an evaluation system has 
64 percent Effective, 12 percent Approaching, andless than 1 percent 4: 
were rated Not Meeting."© high consequences attached, 
Teachers who earn the top two rating categories have significantly “Mw do you take the fear out, 
higher student growth averages than teachers with lower effectiveness and create a system in which 


ratings. both coaching and evaluation 
Recognizes strong teachers and keeps them in the classroom? ji, j om 

Yes, 91 percent of the highestrated teachers were retanedin the |'V@ in the same place? 

district.” — Nicole Wolden, Director of 
Encourages consistently less effective teachers to Growth and Performance®* 


leave the classroom? 
Yes, 37 percent of teachers in the lowest two rating categories in 
2016-2017 were not retained in Denver for the following school 
year (80 percent of those in the lowest category and 35 of those in 
the nextlowest category left)."® 


Helps all teachers improve? 
Yes, 92 percent of teachers with evaluation ratings in 2016-2017 and 2017-2018 received either the same rating 
or improved one rating category, with many teachers receiving the same rating having shown improvement in the 
underlying components of the framework? (although score inflation could also cause these increases). 
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Recruits more effective new teachers? 
Yes, newly hired teachers were on average more effective than exiting teachers in mathematics and were more 
effective in ELA in most years as well. 


Correlates with increased student achievement? 
Yes, in the years since Denver has implemented the evaluation system, the district has experienced consistent student 
growth as measured by state assessments. Denver Public Schools’ students have consistently outpaced their classmates 
‘Statewide in academic growth in English language arts and math.®' Note that these gains cannot be tied directly to the 
evaluation system without a controlled study. 


District of Columbia Public Schools (DCPS) 
Evaluation System: IMPACT 


History: Implemented in 2009 under Chancellor Michelle Rhee and continued under Chancellor Kaya 
Henderson (2010-2016) and then Chancellor Antwan Wilson (2017-5 2018). Now under Interim 
Chancellor Amanda Alexander. 


How it works: Teachers earn one of five ratings. 

Impact on salary: Maximum: $139,126 (base salary of $114,126 pis up to $25,000 in bonuses based on teaching 
assignment) with at least nine years of 

REFLECTING ON SYSTEM 


Notable features 


. The compensation associated with A MPACT is substantial andis the “Teachers who come to DCPS 
largest of any district or state in this analysis. Teachers can earn : : 
up to $25,000 in bonuses annually f they earn the highest of fve  Tecognize what IMPACT is, 
ratings, teach in a targeted high-poverty school, and meet other cr- they come knowing that those 
teria. Teachers who consistently eam high ratings can also advance ‘ 
more quickly up the salary schedule. For example, teachers in af the expectations, and they 
high-poverty schools who have earned the top rating for six or more are exCited to meet them. 
years in a row are paid as though they actually had an additional 12 
years of experience.*” So a seventh-year teacher who consistently 
earned these top ratings is paid as though she is in her 19th year 
and is moved into the Ph.D. lane of the salary schedule, receiving a 
base salary increase of nearly $56,000, in addition to any bonuses 
she earns. 


Whereas some districts have struggled with using evaluations for both teacher accountability and improvement, 
DCPS has created a separate system to help teachers improve, giving them feedback without high stakes attached. 
The district's professional development program partners teachers with content experts for a cycle of learning, 
practicing, and receiving feedback. To read more about how DCPS, an Outstanding District for Great Teachers, 
approaches professional development, visit here. 


DCPS provides an online video library with examples of the district's five “essential practices” of good teaching, in 
addition to online curricula resources for all subjects. 


— Betsy Press, former 
Deputy Chief of Impact”? 


To build accuracy in observations, the district requires new principals to go through four hours of online training, in 
which they watch online videos of instruction and rate teachers to ensure that their ratings are consistent with anchor 
ratings. 
Opportunities for improvement 
= A higher percentage of teachers in low-poverty schools receive the highest evaluation rating, while fewer teachers 
in high-poverty schools receive the highest rating.” There is more work to be done in unpacking the root causes of 
these differences. 
Notable changes to system since inception 


= DCPS now counts individual value-added measures as up to 35 percent of a teacher's evaluation rating, whereas these 
used to count for up to 50 percent. 


The district revised its observation rubric so that itis more flexible and streamlined. 
= DCPS added student surveys for grades 3 to 12, which now comprise 10 percent of teachers’ evaluation ratings. 
The district ended the use of third-party observers (known as Master Educators) in the 2015-2016 school year. 
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BUILDING BUY-IN 


Steps to build buy-in 
To make sure teachers understand the evaluation system, DCPS offers IMPACT guidebooks customized for teachers 
of different grades and subjects. Rather than ask teachers to come to a centralized location, the district provides 
training in individual schools, which makes it easier for all teachers to learn about the system. The district also provides 
principals with resources to facilitate their own overview of the evaluation system and introduces all new teachers to 
IMPACT at the New Teacher Orientation. The IMPACT team is available to answer questions and address teacher and 
administrator concerns, and responds to inquiries within one business day. 


EVIDENCE OF IMPACT 


Provides a more valid measure of teacher quality? “While the district values 
Yes, in 20162017, 20 percent of teachers received a rating below 7 
effective, and 80 percent received one of the top two ratings." stability, we are constantly 
Recognizes strong teachers and keeps them in the classroom? iterating, reflecting, and 


Yes, DCPS retains 92 percent of its effective and highly effective always keeping on top of 


° : ‘. 
bea the ways we can improve. 

While IMPACT was the top reason given for leaving by 22 percent of 

the lower-performing teachers who are no longer DCPS teachers, — Betsy Press, former 


only 6 percent of higher-performing teachers cited it as their top Deputy Chief of Impact” 
reason for leaving. ee 


Encourages consistently less effective teachers to 
leave the classroom? 
Yes, since the implementation of IMPACT, lowperforming teach- 
ers are three times more likely to leave DCPS than high-performing 
teachers, and account for more than a third of teachers exiting DCPS.** 


Helps all teachers improve? 
Yes, lower-performing teachers who stayed in the system but were facing threat of dismissal due to a low evaluation 
rating improved their performance on average. Higher-performing teachers who were near the cut point for a financial 
incentive also improved. 


Recruits more effective new teachers? 
Yes, one study found that the incoming teachers increased student achievement by 0.08 SD in math, compared to 
the teachers who left. Another found that during the first two years of implementation, new hires earned higher ratings 
on the evaluation system than exiting teachers.°” 


Correlates with increased student achievement? 
Yes, since 2009, DCPS has made significant gains on the NAEP assessment in 4th and 8th grade math and reading, a 
though scores declined slightly in math in the most recent NAEP assessment. The white-black student achievement 
gap has also decreased in 4th grade math and reading since 2009. Note that these gains cannot be tied directly 
to the evaluation system without a controlled study. 


ere Bn! 


= ; 6 Lab Eyer > wid ip. 
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Newark Public Schools (NPS)”* 


Evaluation System: Framework for Effective Teaching 


History: Implemented in 2012 under Superintendent Cami Anderson, continued under Superintendent 
Christopher Cerf through 2018, and continues now under new Superintendent Roger Leén, 

How it works: Newark’s evaluation system has four rating categories. 

Impact on salary: Maximum: $100,531 ($95,531 maximum base salary plus up to $5,000 in bonuses), achievable 


after 18 years. 


Maximum prior to evaluation system implementation: $103,159, achievable in 29 years with 
a Ph.D, a masters and 30 credits, or a bachelors with 60 credits. 


REFLECTING ON SYSTEM 
Notable features 


= The district gave the evaluation system a chance to work. While The evaluation system is 
Newark saw students’ achievement decline initially after the new w, 
evaluation system was implemented, the district persevered and more than a way to put — 
student achievement rose to the level it had been before and, in teachers into buckets; it is 


English, exceeded previous levels. for coaching, feedback and 


Newark’s central office creates monthly reports that serve the growth, and sometimes for 
dual purposes of providing principals with data on their progress : 


completing evaluations and tracking the integrity of their ratings @xiting or promoting teachers.” 
by showing their schools’ teacher ratings distribution compared to — Larisa Shambaugh, former 
other schools’ ratings distribution. These reports allow principals, Chief Talent Officer 
their supervisors, and the central office staff to examine the same 

data and have conversations around these data. 


Newark invested substantial resources when the system was first implemented to train principals on what the 
language of the evaluation framework meant, what to look for when observing teachers, and how to reliably rate 
teachers. Now, the district maintains this consistency by holding a brief retraining every summer. 


= Newark offers teachers a rebuttal process if they disagree with their evaluation rating. 


Opportunities for improvement 
= While Newark considers multiple measures in its teacher preparation ratings, it does not use student surveys. 


Student surveys provide another layer of understanding about a teacher's performance and often align with other 
measures of teacher effectiveness. 


Notable changes to system since inception 


= While Newark initially intended to build a video library of best practices, instead they are providing guidance for 
teachers to videotape themselves and share with their peers in a community of practice.”> 


BUILDING BUY-IN 


Steps to build buy-in 4 i tri 
Teachers can provide taciback theousl severe panelsabd adlcoty OEY district that says, 
committees and have a rebuttal process if they disagree with their final ‘We nailed that,’ is not telling 
observation rating.” Also, much of the professional development is the truth — we could always 
co-developed by teachers.”® d 7 

Surveys of teachers and administrators Stn TE: 

According to a 2015 survey, most teachers perceive the evaluation — Larisa Shambaugh, former 
system as valid, accurate, fair, and useful, although support for the Chief Talent Officer‘ 
differentiated compensation system aligned with the evaluation system 

is more mixed. 
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In the same 2015 survey, nearly all school leaders believe that all components of the evaluation system are valid and 


that the evaluations are an accurate measure of teacher performance.”” 


DENCE OF IMPACT 


Provides a more valid measure of teacher quality? 


Re 


Yes, the most recent available ratings distribution found that 4 percent 
of teachers were ineffective, 10 percent were partially effective, 76 
percent were effective, and 11 percent were highly effective,”® 

cognizes strong teachers and keeps them in the classroom? 
Yes, the retention rate for highly effective teachers is 96 percent in 
the fifth year of the evaluation system.” 


Encourages consistently less effective teachers to 
leave the classroom? 


Cor 


Yes, the attrition rate for ineffective teachers is 49 percent in the 
fifth year of implementation. 


relates with increased student achievement? 
Partially. Since implementing these reforms, Newark Public Schools’ 
students are growing at a faster rate in English language arts compared 
with their peers across New Jersey, but there is no significant net 
change in math (after student achievement growth recovered from 
an initial dip in both subjects).® However, much of the gain in English, 
as well as the growth in math after the initial dip, appears to be 
driven by the district's decision to move students out of lower- to 
higher-achieving schools; the teacher evaluation system itself is 
likely not entirely responsible for the growth.22 


‘When ratings go up, we have 
to ask, are we getting better, 
or are we inflating? You need 
to look at other data to keep 
asking yourself that question 
in different ways. And in the 
end, do we see an increase in 
student learning?” 


— Larisa Shambaugh, former 
Chief Talent Officer®> 


Newark’s schools have made gains in closing the gap between the district's student performance on standardized tests 


and the state average. Note that these gains cannot be tied directly to the 


Other evidence of impact? 
Yes, the district has a higher student enrollment now than at any other time in its recent history, a sign of renewed 


confidence in Newark schools.®? 
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evaluation system without a controlled study. 


New Mexico Public Education Department (NMPED) 
Evaluation System: NMTEACH 


History: Implemented in the 2013-2014 school year under Governor Susana Martinez and State Education 
Secretary Hanna Skandera and continues now under Governor Susana Martinez and State Education 
Secretary Christopher Ruszkowski. 

How it works: Teachers earn one of five evaluation ratings. 

Impact on salary: New Mexico gives local districts the authority to set pay scales and therefore any connection between 
teacher salaries and this evaluation system must be decided by each individual district. The state 
mandates a minimum starting salary, but districts determine the remainder of the schedule. 


REFLECTING ON SYSTEM 
Notable features 


1 Requires multiple objective measures: Among all 50 states and “The impact of including 
the District of Columbia, New Mexico is the only state in which the 

ieacher evaluation system ariculates speciic percenlage requrements teacher attendance has 

for five distinct components. These five components include: helped to increase the 

student growth, observations, professionalism, student surveys, and 

teacher absenteeism. number of hours that 

Includes “teacher absenteeism” as a specific metric: Nationwide, New students are served by 

Mexico is the only state to specifically require that teacher absenteeism fully qualified teachers. 

be included as part of a teacher's summative evaluation rating. Under 


this component, comprising five percent of a teacher's overall rating, ad Montafio, as Deputy 
each New Mexico district submits teacher absences to the New lena pe diy el aca 
Mexico Public Education Department. Absences due to the Family Education Department 


and Medical Leave Act, bereavement, ury duty, miltary leave, religious 
leave, professional development, or coaching are excused. In addition, 
there is a grace period of up to six days, meaning that ia district reports 
six days absent for a teacher, the teacher earns 100 percent of the 
possible points for attendance. Ifa district reports more than six days 
absent for a teacher, however, all ofthat teachers absences are included 


in her evaluation. 
Example of possible calculations: 
# of absences Weight Possible Summative 
reported summative points points earned 
Teacher A 0) a 10 10 
Teacher B 19 0.05 10 0.5 
“Teacher C 25 0 10 to) 
Teacher D 6.5 0.675 10 6.75 


Is fully integrated into New Mexico's teacher preparation, licensure, and support systems: 


= Teacher preparation program accountability: New Mexico requires educator preparation programs to collect and 
report data on the performance and effectiveness of program graduates, as measured by student growth data. 

» Licensure advancement/renewal: To advance in their licensure, teachers must demonstrate evidence of teacher 
effectiveness and student learning, which may be demonstrated by earning at least 50 percent of the possible 
points in the improved student achievement domain of their effectiveness report. To renew their licenses, teachers 
must demonstrate how they meet the competencies and indicators for their licensure level through their annual 
evaluation. 
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Opportunities for improvement 


= New Mexico's defiition®? for “ineffective teacher” under the Every 
Student Succeeds Act (ESSA) includes not only any teacher who 
earns an ineffective rating under its teacher evaluation system, but 
also any teacher who earns a student growth rating in the bottom 
decile statewide. This definition supports New Mexico's work to ensure 
that its students — including and particularly the state's lowincome 
students and students of color — have access to teachers with the 
demonstrated ability to increase student learning and growth. In 
doing so, it enables New Mexico to better serve some of its most 
vwulnerable students. As it moves forward with its teacher evaluation 
system implementation, New Mexico should fully integrate its rigorous 
definition for “ineffective teacher” under the ESSA into its teacher 
evaluation system. This policy change would help ensure that any 
teacher who earns student growth ratings in the bottom decile 
statewide earns an ineffective evaluation rating and has access to 
the necessary resources and supports to improve her practice. 


Notable changes to system since inception 


= When New Mexico's evaluation system was first implemented, student 
growth accounted for 50 percent of a teacher's evaluation rating. 
As of the 2017-2018 school year, based on recommendations from 
New Mexico's teachers and leaders, student growth is currently 
weighted at 35 percent.®® 


EVIDENCE OF IMPACT 


Provides a more valid measure of teacher quality? 
Yes, teachers in New Mexico earn ratings that are substantially 
more widely distributed across NMTEACHSs rating categories than 


@ 


“We are able to distinguish 
teachers at five levels with 
a greater understanding of 
how they're impacting student 
achievement.” 


— Matthew Montafio, former Deputy 
Cabinet Secretary of Teaching and 
Learning at the New Mexico Public 

Education Department= 


“We have stuck to the 
premise that student 
achievement is not just 
a part of the evaluation 
system, but an extremely 
important part of the 
evaluation system.” 


— Matthew Montafio, former Deputy 
Cabinet Secretary of Teaching and 
Learning at the New Mexico Public 

Education Department?® 


teachers in other states,° which enables New Mexico to differentiate the resources and supports available to teachers 


with differing levels of effectiveness. 
New Mexico teacher ratings between 2014-2018" : 
60 - 


50 - 


40 - 
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. || 
onion 


iil. 


Ineffective Minimally effective Effective 
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The percentage of teachers rated below effective among 24 states:** 


State Percentage State Percentage 
New Mexico, 28.7 Maryland 2.6 
Oregon 11.7. Michigan 24 
Tennessee 114 North Carolina 23 
Kansas 111 Florida 2.2 
Arizona 7.0 Idaho | 2.2 
Louisiana 6.0 Indiana 2.2 
Massachusetts 49 Georgia 2.0 
New York 49 Delaware 19 
Colorado 4.2 Rhode Island 17 
Connecticut 4.0 New Jersey 1.6 
Ohio 3.7 Pennsylvania 14 
Washington 3.5 Hawaii 0.7 


New Mexico's ability to track the percentage of students taught by teachers earning different evaluation ratings is 
critically important in enabling New Mexico policymakers to make progress in closing its achievement and opportunity 
gaps. During the 2016-2017 school year, nearly 77 percent of New Mexico's students of color were taught by teachers 
rated effective, highly effective, or exemplary. In fact, more students of color in New Mexico — both a higher number 
and a higher percentage — are being taught by highly effective and exemplary teachers than nonminority students. 


Helps all teachers improve? 


Yes, New Mexico's system allows the state to give teachers precise 
insight into areas of improvement. The state's teachers receive their 
evaluation ratings as early as August, and these evaluation ratings 
are directly inked to professional development opportunities. Between 
2015 and 2018, over 1,000 more teachers earned exemplary and 
highly effective ratings under NMTEACH.!2 


Correlates with increased student achievement? 


Yes, between 2015 and 2018, 11,000 more students demonstrated 
grade-level proficiency in math and 13,000 more students are reading 
on grade level — with Native American students improving their 
reading results more than any other group of students, by 8.2 
percentage points.%? Note that these gains cannot be tied directly 
to the evaluation system without a controlled study. 


“Our Teacher of the Year 
(TOY) is selected with 
NMTEACH. Therefore, we 
know our TOY will not have 
simply won a series of 
popularity contests. This 
past year’s recipient, Ivonne 
Orozco, popped on her 
NMTEACH evaluation, and 
everyone who has gotten to 
interact with her can attest 
to her excellence.” 


— Chris Eide, Director of Educator 
Quality at the New Mexico Public 
Education Department?” 
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Tennessee Department of Education (TNDOE) 
Evaluation System: The Tennessee Educator Acceleration Model (TEAM) 


History: Implemented in the 2011-2012 school year under Governor Bill Haslam and Commissioner Kevin 
Huffman and continues now under Governor Bill Haslam and Commissioner Dr. Candice McQueen. 

How it works: Teachers earn one of five ratings. 

Impact on salary: Tennessee provides local districts with a minimum salary schedule that sets a base salary with 


raises earned for each year of experience and type of degree. Distinguishing Tennessee's salary 
schedule from other states is its requirement that districts must also differentiate teacher 
compensation based on at least one of the following criteria: additional roles or responsibilities, 
hard-to-staff schools or subject areas, and performance based on teacher evaluations. 


Minimum number of years to achieve highest salary: After 11 years, Tennessee's state salary 
schedule provides that teachers with a bachelor’s degree will earn $40,595 and those with an ad- 
vanced degree will earn $45,075. These amounts may be supplemented by Tennessee's districts. 


REFLECTING ON SYSTEM 
Notable features 
1 Requires a high number of observations, especially during the frst ~“Qur evaluation system has 


half of the school year. A teacher's prior year performance and 
license type determine number of observations. Most new teachers 
are observed six times and most veteran teachers are observed 
four times — with half of these observations occurring during the first 
haif of the school year. Highly rated teachers are formally observed 
once but also supported by multiple, short walkthroughs. 


Is fully integrated into Tennessee's teacher preparation, licensure, 
support, and dismissal systems, with these particularly noteworthy 
applications: 


Teacher preparation program accountability: Tennessee requires 
educator preparation programs to collect and report data on the 
performance and effectiveness of program graduates, as measured 
by student growth data. 

Tenure: A tenured teacher who receives one of the two lowest 
ratings may be reverted to probationary status until earning a 
higher rating two years in a row. 

Dismissal: Unike 28 states, Tennessee explicitly makes teacher 
ineffectiveness grounds for dismissal and also requires that 
teacher evaluation system ratings serve as sufficient evidence of 
ineffectiveness. 


been a consistent approach 
of developing the capacity 

of teachers to improve. The 
state’s TEAM model provides 
a common language and the 
vocabulary of continuous 
improvement. In fact, more 
than 70 percent of educators 
believe that the TEAM model 
has positive impacts for both 
teachers and students.” 


— Paul Fleming, Tennessee's Assistant 
Commissioner of Teachers and Leaders!!! 


Independently evaluates the system to determine efficacy. In 2016, Tennessee partnered with Vanderbilt University 


to form a research alliance (Tennessee Education Research Alliance (TERA\), with the goal of building Tennessee's 
capacity for continual improvement through independent studies and external research that provide valuable information to 


state policymakers. °° 


Opportunities for improvement 


= Despite the strengths associated with Tennessee's system, it continues to face implementation challenges. Spring 2018 
marked the second time in three years that there were significant online testing disruptions of the state standardized 
assessment, riamely login delays and connectivity issues, which led the legislature to delay any application of evaluation 


results to determine employment or compensation. 
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BUILDING BUY-IN 


Steps to build buy-in 
In response to feedback from educators during the first year ofits evaluation system implementation, Tennessee established 
a centralized communication strategy to provide teachers and school leaders with the information necessary to 
successfully implement this system, as well as to learn from educators’ implementation experiences. As part of this 
process, Tennessee met with more than 7,500 educators and considered feedback — surveys, stakeholder meetings, 
and email communications — from thousands more. In addition, then-Commissioner Huffman visited more than 100, 
districts to discuss evaluation system implementation with educators. 


Surveys of teachers and administrators 
According to the recently released 2018 Tennessee Educator Survey, 72 percent of teachers believe the teacher 
evaluation process used at their school has led to improvements in their teaching. This is up from 38 percent in 
2012. Further, 69 percent of teachers believe the teacher evaluation process used at their school has led to 
improvements in student learning. This is up from 28 percent in 2012. Also, 53 percent of teachers surveyed believe 
that the feedback received from evaluators was focused more on helping teachers improve, rather than making 
judgments about performance." 


The 2018 Tennessee Educator Survey indicates that 92 percent of administrators feel that they receive useful feedback 
as a result of the evaluation process, and 91 percent report making changes to their leadership practices as a result of 
evaluations. Overall, 87 percent of administrators report being satisfied with the evaluation process.°* 


EVIDENCE OF IMPACT 


Recognizes strong teachers and keeps them in the classroom? : 
Yes, Tennessee's teacher retention data demonstrate that teachers “On organizational theory, 
earning highly effective ratings are generally retained at a higher F 
rate than less effective teachers. Specifcaly, in the 20122013 ‘(tS really important to be 
school year, more than 90 percent of teachers earning one of the Clear on the non-negotiables 
three highest ratings in Tennessee's teacher evaluation system were and then provide intentional 
retained, whereas approximately 80 percent of teachers earning the : 
lowest rating were retained. and intensive supports. Take 

Helps all teachers improve? the time on the front end to 
Yes, a recent report" by TERA found that teacher improvement in. — ayelop the rationales so that 
Tennessee appears steeper in recent years and coincides with Tennessee's ioe : ee 
significant investment in efforts to improve teacher professional learing. districts appreciate that this is 
These investments include Tennessee's development and implementation being done with them, rather 
ofits teacher evaluation system inthe 20112012 school year. Speciicaly, than to them.” 
researchers found that teacher improvement” in the years subsequent J 
to the implementation of Tennessee's teacher evaluation system, — Paul Fleming, Tennessee’s Assistant 
from 2013 to 2015, was ‘much more rapid and sustained’ than Commissioner of Teachers and Leaders!#? 
teacher improvement between 2008 and 2010..° 

Correlates with increased student achievement? 

Yes, for the 2016-2017 school year, 55 school districts, equaling 

more than a third of Tennessee's districts, earned overall Tennessee Value-Added Assessment System (TVAAS) com- 
posites representing the highest level of growth. This includes districts with a wide range of academic achievement 
and student demographics. Further, fewer districts reported the lowest two categories of growth during this same 
year.” These data represent an improvement over the 2014-2015 and 2015-2016 school years, during which 46 
and 52 districts, respectively, earned overall TVAAS composites representing the highest level of growth."!© Note that 
these gains cannot be tied directly to the evaluation system without a controlled study. 
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Appendix 


Key components of an evaluation system 


Multiple measures for final evaluation score: Many factors should be considered in formally evaluating a teacher, with 
teacher evaluation instruments, including factors that combine both human judgment and objective measures of student 
learning. The Measures of Effective Teaching (MET) project found that multiple measures — namely classroom observations, 
student surveys, and measures of student achievement — produce more consistent, stable ratings over the years, as 
compared to a single measure of effectiveness." 


Student surveys: The use of student survey data can provide a rich picture of teacher effectiveness in the classroom 
and further strengthen an evaluation systems ability to identify teachers’ effectiveness.""* Research finds not only that 
student input on teacher quality adds value to teacher evaluation systems, but also that teachers often prefer evaluation 
systems that include student survey data." Further, teacher evaluation systems that include student survey data, which 
are somewhat correlated with teachers’ student growth measures, are stronger, more reliable, and more valid than those 
that rely solely on administrator reports and observations.= 


Objective measures of student growth: Teacher evaluation systems should include objective measures of student 
learning, in addition to factors that rely on informed, subjective judgments of teacher performance. Evidence shows that 
teachers who increase their students’ learning positively influence those students’ long-term achievements (e.g., higher 
likelihood of attending college, earning higher salaries), in addition to benefiting their immediate academic outcomes.” 
Although teachers have many responsiblities, advancing their students’ academic achievement is one of every teacher's 
primary goals. 


At least three rating categories: Evaluation instruments that differentiate among various levels of teacher performance, 
rather than those with binary satisfactory/unsatisfactory ratings, further increase the utlty and validity of evaluation systems. 
Traditionally, binary rating systems have offered litle meaningful information because virtually all teachers received satisfactory 
ratings.""8 More rating categories allow for more nuanced distinctions among levels of teacher performance. 


Annual observations and evaluations for all teachers: The aim of teacher evaluations is to help all teachers improve, 
and therefore all teachers need feedback on their performance every year. Observations serve several purposes, including 
providing actionable feedback to teachers and informing a summative rating that can be used in staffing decisions. Observations 
are a rich source of information for teachers, giving them useful feedback to improve their practice. Because a teacher's 
effectiveness can vary from year to year, multiple years of data (e.g., two years of value-added measures (VAM) data) 
should be used in determining a teacher's summative evaluation rating to increase accuracy.!!? 


Professional development tied to evaluations: A primary purpose of teacher evaluations is to identify areas in which 
a teacher is performing well and areas in which she needs to improve. However, simply naming these areas is ikely insufficient. 
States and districts should also provide professional development that is tailored to a teacher's specific needs. A 
landmark study of professional development found that, commonly, teachers do not get clear information about how to 
improve, nor do they feel that their professional development is customized based on their needs.12 
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Written feedback after each observation: Constructive, detailed feedback and coaching is a critical component of teach- 
er development. Requiring feedback in written form (ideally accompanied by a conference between the teacher and 
observer) can help ensure that observers give deliberate thought to the feedback they impart and provide teachers with 
guidance that they can revisit as needed.'® Teachers find regular feedback helpful for improving their instructional prac- 
tice, and they are more likely to believe that the evaluation system is improving their practice when they receive frequent 
observations and feedback.'* 


Compensation tied to evaluations: Teachers’ salaries can be a tool to recruit and retain teachers and to incentivize or 
reward certain behaviors.'® Traditional salary schedules tend to favor years of experience (which generally correspond 
with increased effectiveness only in a teacher's early years)!* and level of education (which, except in rare cases like 
graduate degrees in mathematics, are unrelated to a teacher's effectiveness)."®” Tying compensation to effectiveness, 
instead of years of experience and attainment of a graduate degree, can create a system that better aligns with a district's 
values and may be a means of persuading more effective teachers to stay in the district. Additionally, not giving salary 
increases to ineffective teachers sends an important signal about the significance of teacher effectiveness. 
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